Hauppauge
Mathematical Modeling and Convergence Analysis of Deep Neural Networks with Dense Layer Connectivities in Deep Learning
Huang, Jinshu, Su, Haibin, Tai, Xue-Cheng, Wu, Chunlin
In deep learning, dense layer connectivity has become a key design principle in deep neural networks (DNNs), enabling efficient information flow and strong performance across a range of applications. In this work, we model densely connected DNNs mathematically and analyze their learning problems in the deep-layer limit. For a broad applicability, we present our analysis in a framework setting of DNNs with densely connected layers and general non-local feature transformations (with local feature transformations as special cases) within layers, which is called dense non-local (DNL) framework and includes standard DenseNets and variants as special examples. In this formulation, the densely connected networks are modeled as nonlinear integral equations, in contrast to the ordinary differential equation viewpoint commonly adopted in prior works. We study the associated training problems from an optimal control perspective and prove convergence results from the network learning problem to its continuous-time counterpart. In particular, we show the convergence of optimal values and the subsequence convergence of minimizers, using a piecewise linear extension and $Γ$-convergence analysis. Our results provide a mathematical foundation for understanding densely connected DNNs and further suggest that such architectures can offer stability of training deep models.
- Asia > China > Tianjin Province > Tianjin (0.04)
- North America > United States > Rhode Island > Providence County > Providence (0.04)
- North America > United States > New York > Suffolk County > Hauppauge (0.04)
- (7 more...)
- Health & Medicine (0.46)
- Education > Focused Education > Special Education (0.45)
Non-Gaussianities in Collider Metric Binning
Metrics for rigorously defining a distance between two events have been used to study the properties of the dataspace manifold of particle collider physics. The probability distribution of pairwise distances on this dataspace is unique with probability 1, and so this suggests a method to search for and identify new physics by the deviation of measurement from a null hypothesis prediction. To quantify the deviation statistically, we directly calculate the probability distribution of the number of event pairs that land in the bin a fixed distance apart. This distribution is not generically Gaussian and the ratio of the standard deviation to the mean entries in a bin scales inversely with the square-root of the number of events in the data ensemble. If the dataspace manifold exhibits some enhanced symmetry, the number of entries is Gaussian, and further fluctuations about the mean scale away like the inverse of the number of events. We define a robust measure of the non-Gaussianity of the bin-by-bin statistics of the distance distribution, and demonstrate in simulated data of jets from quantum chromodynamics sensitivity to the parton-to-hadron transition and that the manifold of events enjoys enhanced symmetries as their energy increases.
- North America > United States > New York > Suffolk County > Hauppauge (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
CueTip: An Interactive and Explainable Physics-aware Pool Assistant
Memery, Sean, Denamganai, Kevin, Zhang, Jiaxin, Tu, Zehai, Guo, Yiwen, Subr, Kartic
We present an interactive and explainable automated coaching assistant called CueTip for a variant of pool/billiards. CueTip's novelty lies in its combination of three features: a natural-language interface, an ability to perform contextual, physics-aware reasoning, and that its explanations are rooted in a set of predetermined guidelines developed by domain experts. We instrument a physics simulator so that it generates event traces in natural language alongside traditional state traces. Event traces lend themselves to interpretation by language models, which serve as the interface to our assistant. We design and train a neural adaptor that decouples tactical choices made by CueTip from its interactivity and explainability allowing it to be reconfigured to mimic any pool playing agent. Our experiments show that CueTip enables contextual query-based assistance and explanations while maintaining the strength of the agent in terms of win rate (improving it in some situations). The explanations generated by CueTip are physically-aware and grounded in the expert rules and are therefore more reliable.
- Europe > United Kingdom > Scotland > City of Edinburgh > Edinburgh (0.04)
- Europe > United Kingdom > England > Greater London > London (0.04)
- North America > United States > New York > Suffolk County > Hauppauge (0.04)
- (3 more...)
- Leisure & Entertainment > Games (0.67)
- Health & Medicine (0.67)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- (2 more...)
A Step Toward Interpretability: Smearing the Likelihood
The problem of interpretability of machine learning architecture in particle physics has no agreed-upon definition, much less any proposed solution. We present a first modest step toward these goals by proposing a definition and corresponding practical method for isolation and identification of relevant physical energy scales exploited by the machine. This is accomplished by smearing or averaging over all input events that lie within a prescribed metric energy distance of one another and correspondingly renders any quantity measured on a finite, discrete dataset continuous over the dataspace. Within this approach, we are able to explicitly demonstrate that (approximate) scaling laws are a consequence of extreme value theory applied to analysis of the distribution of the irreducible minimal distance over which a machine must extrapolate given a finite dataset. As an example, we study quark versus gluon jet identification, construct the smeared likelihood, and show that discrimination power steadily increases as resolution decreases, indicating that the true likelihood for the problem is sensitive to emissions at all scales.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- North America > United States > New York > Suffolk County > Hauppauge (0.04)
- Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)
CAUS: A Dataset for Question Generation based on Human Cognition Leveraging Large Language Models
Shin, Minjung, Kim, Donghyun, Ryu, Jeh-Kwang
We introduce the Curious About Uncertain Scene (CAUS) dataset, designed to enable Large Language Models, specifically GPT-4, to emulate human cognitive processes for resolving uncertainties. Leveraging this dataset, we investigate the potential of LLMs to engage in questioning effectively. Our approach involves providing scene descriptions embedded with uncertainties to stimulate the generation of reasoning and queries. The queries are then classified according to multi-dimensional criteria. All procedures are facilitated by a collaborative system involving both LLMs and human researchers. Our results demonstrate that GPT-4 can effectively generate pertinent questions and grasp their nuances, particularly when given appropriate context and instructions. The study suggests that incorporating human-like questioning into AI models improves their ability to manage uncertainties, paving the way for future advancements in Artificial Intelligence (AI).
- Asia > South Korea > Seoul > Seoul (0.05)
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > Washington > King County > Seattle (0.04)
- (4 more...)
Creating Large Language Model Resistant Exams: Guidelines and Strategies
The proliferation of Large Language Models (LLMs), such as ChatGPT, has raised concerns about their potential impact on academic integrity, prompting the need for LLM-resistant exam designs. This article investigates the performance of LLMs on exams and their implications for assessment, focusing on ChatGPT's abilities and limitations. We propose guidelines for creating LLM-resistant exams, including content moderation, deliberate inaccuracies, real-world scenarios beyond the model's knowledge base, effective distractor options, evaluating soft skills, and incorporating non-textual information. The article also highlights the significance of adapting assessments to modern tools and promoting essential skills development in students. By adopting these strategies, educators can maintain academic integrity while ensuring that assessments accurately reflect contemporary professional settings and address the challenges and opportunities posed by artificial intelligence in education.
- Europe > Spain (0.05)
- North America > United States > New York > Suffolk County > Hauppauge (0.04)
- North America > Mexico (0.04)
- North America > Canada (0.04)
Computational design of antimicrobial active surfaces via automated Bayesian optimization
Biofilms pose significant problems for engineers in diverse fields, such as marine science, bioenergy, and biomedicine, where effective biofilm control is a long-term goal. The adhesion and surface mechanics of biofilms play crucial roles in generating and removing biofilm. Designing customized nano-surfaces with different surface topologies can alter the adhesive properties to remove biofilms more easily and greatly improve long-term biofilm control. To rapidly design such topologies, we employ individual-based modeling and Bayesian optimization to automate the design process and generate different active surfaces for effective biofilm removal. Our framework successfully generated ideal nano-surfaces for biofilm removal through applied shear and vibration. Densely distributed short pillar topography is the optimal geometry to prevent biofilm formation. Under fluidic shearing, the optimal topography is to sparsely distribute tall, slim, pillar-like structures. When subjected to either vertical or lateral vibrations, thick trapezoidal cones are found to be optimal. Optimizing the vibrational loading indicates a small vibration magnitude with relatively low frequencies is more efficient in removing biofilm. Our results provide insights into various engineering fields that require surface-mediated biofilm control. Our framework can also be applied to more general materials design and optimization.
- North America > United States > New York > Tompkins County > Ithaca (0.04)
- North America > United States > New York > Suffolk County > Hauppauge (0.04)
SPX Announces Purchase of ULC Robotics
SPX Corporation announced that it has acquired ULC Robotics, a leading developer of robotic systems, machine learning applications and inspection technology for the energy, utility and industrial sectors. A pioneer in the field of Robotics-as-a-Service (RaaS), ULC operates a growing, recurring-revenue business called CISBOT which uses robotic solutions designed to rehabilitate and extend the life of natural gas distribution networks for utility customers. ULC also operates a custom Research & Development ("R&D") business that provides cutting edge technology solutions to a growing base of utility and industrial customers. ULC is headquartered in Hauppauge, New York, with a significant presence in the United Kingdom. Its results will be reported with SPX's Location & Inspection platform within its Detection & Measurement segment.
The Unproven, Invasive Surveillance Technology Schools Are Using to Monitor Students
ProPublica is a nonprofit newsroom that investigates abuses of power. Sign up for ProPublica's Big Story newsletter to receive stories like this one in your inbox as soon as they are published. Ariella Russcol specializes in drama at the Frank Sinatra School of the Arts in Queens, New York, and the senior's performance on this April afternoon didn't disappoint. While the library is normally the quietest room in the school, her ear-piercing screams sounded more like a horror movie than study hall. But they weren't enough to set off a small microphone in the ceiling that was supposed to detect aggression.
- North America > United States > New York > Queens County > New York City (0.24)
- North America > United States > Nevada > Clark County > Las Vegas (0.05)
- North America > United States > Florida > Broward County > Parkland (0.05)
- (13 more...)
A Device to Detect 'Aggression' in Schools Often Misfires
This story was co-published with ProPublica. Ariella Russcol specializes in drama at the Frank Sinatra School of the Arts in Queens, New York, and the senior's performance on this April afternoon didn't disappoint. While the library is normally the quietest room in the school, her ear-piercing screams sounded more like a horror movie than study hall. But they weren't enough to set off a small microphone in the ceiling that was supposed to detect aggression. A few days later, at the Staples Pathways Academy in Westport, Connecticut, junior Sami D'Anna inadvertently triggered the same device with a less spooky sound--a coughing fit from a lingering chest cold.
- North America > United States > New York > Queens County > New York City (0.24)
- North America > United States > Connecticut > Fairfield County > Westport (0.24)
- North America > United States > Utah (0.05)
- (14 more...)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Health & Medicine (1.00)
- Education > Health & Safety > School Safety & Security > School Violence (0.96)
- Education > Educational Setting > K-12 Education (0.72)